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Abstract 

Background: Questionnaires are valuable for population surveys of mental health. Different survey instruments may 
however give different results. The present study compares two mental health instruments, the Major Depression 
Inventory (MDI) and the Mental Health Inventory (MHI-5), in regard to their prediction of long-term sickness 
absence. 

Method: Questionnaire data was collected from N =4153 Danish employees. The questionnaire included the MDI 
and the MHI-5. The information of long-term sickness absence was obtained from a register. We used Cox 
regression to calculate covariance adjusted hazard ratios for long-term sickness absence for both measures. 

Results: Both the MDI and the MHI-5 had a highly significant prediction of long-term sickness absence. A one 
standard deviation change in score was associated with an increased risk of long-term sickness absence of 27% for 
the MDI and 37% for the MHI-5. When both measures were included in the same analysis, the MHI-5 performed 
best. 

Conclusion: In general population surveys, the MHI-5 is a better predictor of long-term sickness absence than 
the MDI. 

Keywords: Head-to-head comparison, Survey instrument, Questionnaire 



Background 

Mental disorder is a main cause of the disease burden in 
developed countries [1]. Thus, policy makers have an 
interest in the level of mental health and prevalence of 
mental health problems in the population. Clinical inter- 
views are the gold standard for measuring psychiatric 
morbidity, however such interviews are costly. An alter- 
native is self-report questionnaires of mental health. 
These questionnaires are cheaper and practical for popu- 
lation surveys and they can provide an assessment of all 
respondents. Questionnaire data on mental health has 
been shown to be associated with reduced workability 
[2,3], lower work performance, and sickness absence 
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[4-7]. Questionnaires can, however, not find true preva- 
lence but only estimate risk proportions [8]. 

Several mental health instruments exist and depending 
on the chosen questionnaire the estimate of the risk pro- 
portion of the population may be very different: For ex- 
ample, in Croatia, Rukavinia et al. used the Mental Health 
Inventory (MHI-5) and estimated that the proportion with 
'psychological distress' (scored from 0 to 100, cut-off at 52 
point) was 34% in the adult population [9]. In Canada 
the proportion with 'high psychological distress' was 38% 
using the K-10 scale (scored from 0 to 40, cut-off at 9 
point) [10]. In Denmark, the risk proportion of 'all kinds 
of depression' was 7.1% using the Major Depression 
Inventory (MDI) (scored from 0 to 50, cut-off at 20 
points) [11]. 

In the present study, we evaluate the two mental health 
instruments the Major Depression Inventory (MDI) and 
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the Mental Health Inventory (MHI-5) in regard to the 
prediction of long-term sickness absence. If the mental 
health of a person causes him or her to have a long-term 
sickness absence period from work, it is a severe health 
problem. Furthermore, the cumulative cost of sickness ab- 
sence due to mental health problems is an economic bur- 
den for society. We therefore consider long-term sickness 
absence to be a relevant outcome for evaluating the per- 
formance of a mental health instrument used in popula- 
tion surveys. 

The two questionnaires, the MDI and the MHI-5, have 
different designs (see Table 1): the MDI questions are a 
list of specific symptoms of depression; the MHI-5 ques- 
tions are more generally formulated. Despite the differ- 
ences, both instruments have been validated as measures 
of depression [12,13] and both have been shown to pre- 
dict sickness absence [4,5]. In the present study we com- 
pare the instruments in the same study in a so-called 
head-to-head comparison, and we are therefore able to 
identify the best performing instrument in regard to the 
prediction of long-term sickness absence. We also divide 
the population into high risk and low risk groups by sev- 
eral methods for each instrument, and calculate the pre- 
diction of long-term sickness absence using different 
categorisations of the scales. 



Methods 

Study sample and design 

Data was collected as part of the DAnish National work- 
ing Environment Survey (DANES) from late 2008 until 
early 2009. The DANES was approved by the Danish 
Data Protection Agency, journal number: 2008-54-0553. 
According to Danish law, questionnaire and register 
based studies do not need approval by ethical and scien- 
tific committees. 

Questionnaire data was collected from a random sam- 
ple of the Danish employed population, age 18-59 years. 
Participants could answer the questionnaire either by 
internet or by post. Non-responders received two re- 
minders and were finally contacted and invited to par- 
ticipate in a telephone interview. The study contacted 
9913 persons, 6531 responded (66%), of which 4919 
were employees. We excluded participants with missing 
values on MHI-5 and MDI (n = 439), with missing re- 
sponse date (n = 150), with a sickness absence spell last- 
ing 4 or more weeks in the preceding 3 months before 
or overlapping baseline (n = 177), yielding a study sample 
of 4153 participants. In the multivariable adjusted analyses 
the sample was, because of missing values to covariates, 
further reduced to 3713 participants. The 3713 participants 
are 75% of the employees that returned the questionnaire. 



Table 1 The Mental Health Inventory (MHI-5) and the Major Depression Inventory (MDI) 


MHI-5 




Response categories 




How much of the time during the last 4 weeks, have you. . .. 




(1) 


Been a very nervous person? 


(a) All of the time 


(2) 


Felt so down in the dumps that nothing could cheer you up? 


(b) Most of the time 


(3) 


Felt calm and peaceful? 


(c) A good bit of the time 


(4) 


Felt downhearted and blue? 


(d) Some of the time 


(5) 


Been a happy person? 


(e) A little of the time 






( f) At no time 


MDI 




Response categories 




How much of the time in the last 2 weeks... 




(D 


Have you felt low in spirit or sad? 




(2) 


Have you lost interest in your daily activities? 




(3) 


Have you felt lacking in energy and strength? 




(4) 


Have you felt less self-confident? 


(a) All of the time 


(5) 


Have you had a bad conscience or feelings of guilt? 


(b) Most of the time 


(6) 


Have you felt that life wasn't worth living? 


(c) Slightly more than half of the time 


(7) 


Have you had difficulty in concentrating, e.g., when reading the newspaper or watching television? 


(d) Slightly less than half of the time 


(8a)* 


Have you felt very restless? 


(e) A little of the time 


(8b)* 


Have you felt subdued? 


( f) At no time 


(9) 


have you had trouble sleeping at night? 




(10a)* 


Have you suffered from reduced appetite? 




(10b)* 


Have you suffered from increased appetite? 





'*'ln the MDI is only the question with the highest response value of the questions 8a/8b and of the questions 1 0a/1 0b used in the total score. 
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If we assume the return rate of the employees is equal to 
the return rate of the full sample, i.e., 66%, the sample for 
the final analysis is 50% of the employed people the ques- 
tionnaire originally was mailed to. 

Our long-term sickness absence data came from a na- 
tional register, the DREAM register [14]. The DREAM 
register has information on all Danish social transfer 
payments on a weekly basis. Employers are entitled to 
compensation from the municipalities if an employee 
has an absence spell of 22 days or longer. 

The major depression inventory 

The Major Depression Inventory (MDI) was developed 
in the late 1990s [12]. It was designed to measure de- 
pression symptoms in accordance with the symptom 
guidelines defined by the WHO classification for uni- 
polar depression (ICD-10) and the American Psychiatric 
Association classification for major depression (DSM-IV) 
[12,15]. The instrument consists of 12 questions and it in- 
cludes 2 algorithms that classify participants with risk of 
unipolar depression (mild, moderate or severe) according 
to the ICD-10 definition or with risk of major depression 
according to the DSM-IV definition. The instruments can 
also be used to measure depressive symptoms scored on a 
scale ranging from 0 to 50, a higher score indicates a 
higher level of depressive symptoms. Both Bech et al. [12] 
and Forsell [16] have validated the MDI as a measure of 
depression using SCAN clinical interviews. 

The questions of the MDI are shown in Table 1. 

We calculated the MDI's predictive value for long- 
term sickness absence when we used the MDI as: (1) a 
standardized scale score, (2) the scale categorized into 
five ordinal levels [5], (3) dichotomized into likely de- 
pression or not by the cut-off >20 in accordance with 
previous studies [5,16], likely depression or not based on 
the DSM-IV algorithm, and likely depression or not 
based on the ICD-10 algorithm (we pooled the ICD-10 
categories mild, moderate and severe) [12]. 

The mental health inventory 

The mental health inventory (MHI-5) is a five-question 
subscale of the general health measure SF-36 [17]. The 
MHI-5 includes questions referring to both positive and 
negative aspects of mental health, and questions refer- 
ring to both depression and anxiety [18,19]. Berwick 
et al. [13], Cuijpers et al. [18] and Rumph et al. [20] have 
validated the MHI-5 as a measure for depression using 
clinical interviews as the gold standard. The MHI-5 has 
also been tested as a measure of anxiety, somatoform 
disorders, and substance use disorders, but has failed to 
perform equally well in these tests [18,20]. The questions 
of the MHI-5 are shown in Table 1. 

We scored the MHI-5 from 0 (poor mental health) to 
100 (good mental health) [21]. Several cut-offs for dividing 



the responders into high risk and low risk group have 
been suggested for the MHI-5. The cut-off points are de- 
rived from ROC-curves or through another minimum 
misclassification criterion. The most used cut-off point is 
probably 52 [4,22], other cut-off points include 54 and 74 
[18] and the cut-offs 60, 68 and 76 [23]. We calculated the 
predictive value for long-term sickness absence using the 
MHI-5 as: (1) a standardized scale score; (2) categorized 
into 5 ordinal levels; and (3) dichotomized by four differ- 
ent cut-offs, i.e. the cut-offs 52, 60, 68 and 76. 

Long-term sickness absence 

Our outcome was time until onset of first episode of 
long-term sickness absence. In our analysis, long-term 
sickness absence was defined as minimum 4 weeks of 
consecutive sickness absence, because 4 weeks are the 
minimum period that is registered within the weekly 
based DREAM register. We followed responders in the 
register from 12 weeks before response date and until 
60 weeks after response date. Participants, with long- 
term sickness absence in the 12 weeks before or overlap- 
ping the particular participant's response date, were ex- 
cluded from the study. Only sickness absence that 
comes after the response of the questionnaire is included 
in our analyses. 

Covariates 

The following covariates were included in the analyses: 
Age, gender, family status, smoking, alcohol, body mass 
index, leisure time physical activity, social class, somatic 
chronic illness, self-rated health and method of data col- 
lection. With the exception of age and gender, which 
were retrieved from registers, the information on the co- 
variates came from the questionnaire. 

We divided family status into four categories: (1) 
cohabitating, with children living at home, (2) cohabiting, 
without children living at home, (3) not cohabitating, with 
children living at home, (4) not cohabiting, without chil- 
dren living at home. Smoking was categorized into four 
levels; (1) never smoked, (2) ex-smoker, (3) light smoker, 
i.e., from occasional smoker to 14 cigarettes a day, and 
(4) heavy smoker >15 cigarettes a day. Alcohol intake was 
categorized into four levels following the guidelines of 
the Danish National Board of Health: (1) < 7 units weekly, 
(2) 8-14 units, (3) 15-21 units, (4) > 21 units weekly. Body 
mass index was categorized into 4 levels according to the 
WHO categorization: (1) < 18.5 (underweight), (2) 18.5-25 
(normal weight), (3) 25-30 (overweight), (4) > 30 (obese). 
Leisure time physical activity was scored on a scale from 0 
to 24 points. Social class was categorized in accordance 
with the European Socio-economic Classification (ESeC). 
Somatic chronic illness was constructed as a binary vari- 
able (somatic chronic illness or not). Self- rated health was 
scored on a scale from 1-5 points. The self-rated health 
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question primarily measures physical health [24,25] and is 
therefore used together with somatic chronic illness to 
control for the responders physical health. Method of data 
collection was included, since responses are influenced by 
the data collection method [26]. Family status and social 
status were used as categorical variables in the analyses. All 
other covariates were assumed to be ordinal or linear. 

Statistical analyses 

We compared the MDI and MHI-5 in a head-to-head 
design [27]. We calculated the Pearson correlation be- 
tween the MDI and the MHI-5 and we compared the 
distribution of MDI and the MHI-5 in two plots. We 
calculated the predictive value for long-term sickness ab- 
sence by Cox's proportional hazard in both a univariate 
regression analysis and a multivariate regression analysis. 
Participants were censored at the date of emigration, 
death, retirement, maternity leave or end of follow-up 
(60 weeks after baseline), whichever came first. We ex- 
amined the proportional hazard assumption by visual in- 
spection of log-log-survival plots. The predictive value of 
MDI and MHI-5 for long-term sickness absence was cal- 
culated with (1) measures dichotomized by different cut- 
offs, (2) 5-level categorical indexes, and (3) standardized 
scores. We standardized the two scores of the MDI 
and MHI-5 using mean = 0 and standard deviation = 1. 
The standardized MHI-5 score was reversed so high 
score equalled poor mental health for both scores. In 
order to evaluate which mental health instrument that 
had the best prediction of long-term sickness absence, 
we included both standardized scores in the same re- 
gression analysis. If one measure in such an analysis 
have a substantial better significance level than the 
other measure, it can be concluded that this measure 
better capture the variation in the data. To further 
confirm the result from this analysis, we performed a 
cross-validation by randomly splitting our sample into 
two independent samples and then carry out the ana- 
lysis in each sample. 

Results 

The MDI 

The MDI standardized score, the five-level categorized 
score, and the three dichotomized scores all predicted 
long-term sickness absence (see Table 2). The standard- 
ized score had the highest significance level. The 5-level 
categorization of MDI showed a steep increase in HR 
from the next highest to the highest level. This suggests 
that the best cut-off point for a dichotomization into a 
risk group and non-risk group is near this score, i.e., 
near 20 points (risk group = 7%). The risk group in- 
cluded 90 to 301 persons depending on which dichoto- 
mization we used, i.e., the risk group was from 2 to 7% 
of the employed adult population (see Table 2). 



The MHI-5 

The MHI-5 standardized score, the five-level categorized 
score, and the four dichotomized scores all predicted 
long-term sickness absence (see Table 2). The standard- 
ized score had the highest significance level. The 5-level 
categorization of MHI-5 showed an increase in HR for 
each level. The increase between the next highest and 
the highest level of the categorization was particular 
steep. This suggests that the best cut-off point for a di- 
chotomization is near 60 points (risk group = 14%). The 
risk group included from 304 to 1560 persons depending 
on which dichotomization we used, i.e., from 7 to 38% 
of the employed adult population. 

MDI and MHI-5 compared 

The HRs of the standardized scores estimate the in- 
creased risk for one standard deviation increase of the 
standardized score, i.e., the increased risk of sickness ab- 
sence for a one standard deviation of worse mental 
health. In the adjusted analyses, the HR of MDI was 1.27 
and the HR of MHI-5 was 1.37, both were highly sig- 
nificant (see Table 2). With both instruments included 
in the same regression analysis, the MHI-5 remained 
significant and the MDI became non-significant (see 
Table 2), i.e., the MHI-5 explained a significant part 
of the variation in long-term sickness absence even 
when we controlled for the MDI. To confirm the sta- 
bility of this result, we cross-validated the result by 
randomly splitting our sample in two independent 
samples and then carry out the analysis in each sam- 
ple. Both results supported our initial finding (results 
not shown). Thus, the MHI-5 explained the variation 
in long-term sickness absence better than the MDI. 

Figure 1 shows the distribution of MDI and MHI-5 
scores within the study population. Both distributions 
are positively skewed, indicating that they distinguish 
better between degrees of reduced mental health than 
degrees of good mental health. The MDI score corre- 
lated with the MHI-5 score with a Pearson correlation 
coefficient = -0.69 and p < 0.0001. The correlation is il- 
lustrated in Figure 2. The Figure also illustrates that not 
all participants classified as high risk by the MDI were 
classified as high risk by the MHI-5 or vice versa. The 
MDI 'cut-off 20' and the MHI-5 'cut-off 52' each classi- 
fied 7.3% as high risk. However, only 3.8% of the 'cases' 
overlapped. 

Discussion 

Both the standardized MDI score and the standardized 
MHI-5 score strongly predicted long-term sickness ab- 
sence. However, the MHI-5 captured more of the vari- 
ability than the MDI when both instruments were 
included in the same analysis. The MHI-5 therefore per- 
forms better than the MDI as a predictor of long-term 
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Table 2 The MDi's and the MHI-5's predictive value for long-term sickness absence 







Univariate 




Adjusted '*' 






N 


HR [CI 95] 


p-value 


HR [CI 95] 


p-value 


MDI 


4153 


144 [1.31; 1.59] 


3.31 E-14 


1.27 [1.12; 1.43] 


0.0001 


MHI-5 


4153 


1.50 [1.35; 1.66] 


9E-15 


1.37 [1.21; 1.55] 


5.57E-07 


MDI (with MHI-5 as covariate) 








1 .07 [0.92; 1 .26] 


037 


MHI-5 (with MDI as covariate) 








1.31 [1.11; 1.54] 


0.001 


MDI dichotomized 












MDI cutpoint 20 depression (yes/no) 


301 


2.79 [2.00; 3.88] 


1.5E-09 


1.98 [1.36; 2.89] 


0.0004 


MDI ICD-10 depression (yes/no) 


113 


3.58 [2.29; 5.60] 


2.28E-08 


2.08 [1.26; 3.43] 


0.004 


MDI DSM-IV depression (yes/no) 


90 


3.40 [2.05; 5.64] 


2.23E-06 


1.92 [1.08; 3.41] 


0.03 


MDI categorica 












MDI = 0-4 (reference level) 


1591 


1 




1 




MDI = 5-9 


1456 


1.55 [1.11; 2.17] 




1 42 [0.99; 2.03] 




MDI = 10-14 


515 


2.33 [1.57; 345] 




1.60 [1.03; 2.50] 




MDI = 15-19 


290 


2.05 [1.25; 3.38] 




1 .38 [0.80; 2.38] 




MDI > =20 


301 


4.08 [2.74; 6.07] 




2.68 [1.68; 4.27] 




(p-value for all 5 levels) 






8.52E-1 1 




0.001 


MHI-5 dichotomized 












MHI cutpoint 52 mental ill (yes/no) 


304 


2.58 [1 .84; 3.62] 


4.32E-08 


1.92 [1.32; 2.81] 


0.0007 


MHI cutpoint 60 mental ill (yes/no) 


595 


2.85 [2.17; 3.73] 


2.82E-14 


2.20 [1.62; 2.99] 


4.83E-07 


MHI cutpoint 68 mental ill (yes/no) 


978 


2.44 [1.90; 3.14] 


4.82E-12 


1 .99 [1 .49; 2.66] 


2.83 E-06 


MHI cutpoint 76 mental ill (yes/no) 


1560 


2.05 [1 .59; 2.64] 


2.14E-08 


1.57 [1.18; 2.09] 


0.002 


MHI-5 categorical 












MHI = 91-100 (reference level) 


1104 


1 




1 




MHI = 81-90 


1093 


1 .02 [0.67; 1 .54] 




1.01 [0.65; 1.57] 




MHI = 71-80 


978 


1 .39 [0.94; 2.07] 




1.21 [0.78; 1.86] 




MHI =61-70 


383 


1.77 [1.10; 2.87] 




1.57 [0.94; 2.61] 




MHI < 60 


595 


3.41 [2.35; 4.94] 




2.56 [1.67; 3.92] 




(p-value for all 5 levels) 






5.75E-13 




9.09E-06 


'*'The adjusted analyses include the covariates: sex, 


age, family status, smoking, alcohol, body mass 


index, leisure time phys 


cal activity, social class. 


somatic 



chronic illness, self-rated health and data collection method. 
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Figure 2 The association between MDI and MHI-5. Bubble size indicates no. of persons with the particular response pattern. The MDI cut-off 
at 20 and the MHI-5 cut-off at 52 are marked with a line, the resulting cross-hair illustrates that 'cases' identified by MDI and MHI-5 is only 
partly overlapping. 



sickness absence. Furthermore, the MHI-5 consists of 
fewer questions than the MDI and is therefore more 
'economic'. Our study can, however, not conclude that 
MHI-5 is a better measure of depression than the MDI; 
it can only conclude that the predictive value for long- 
term sickness absence is higher for the MHI-5 than for 
the MDI. 

The size estimations of the high risk group from the 
different dichotomizations of the MDI (2 to 7%) were 
more restrictive than the size estimations of the high risk 
group from the different dichotomizations of the MHI-5 
(7 to 38%). If the MDI only identified people with risk of 
depression and the MHI-5 identified people with re- 
duced mental health in general, the MDI should indeed 
identify less people than the MHI-5. This could also 
explain why the MHI-5 is a better predictor of sick- 
ness absence than the MDI. However, studies show 
that the MHI-5 performs better as a measure of de- 
pression than as a measure of anxiety or substance 
disorder [13,18,20,28]. It is possible that the MHI-5 is 
more general than the MDI but the MHI-5 is not 
general enough to show strong validity as a measure 
of other disorders than mood disorder. 

We chose to compare the predictive validity (i.e., the 
ability to predict a relevant outcome) of the two instru- 
ments. A limitation of our study is that only one out- 
come was tested, time until long-term sickness absence. 
Other outcomes, such as number of sick days or prod- 
uctivity at the workplace, may have given a different result. 
The population, in which we test the instruments, may also 
be of importance for the result. The MDI might perform 
better than the MHI-5 in a high risk population. The 
population in our study is a general working population, 
and only approximately 50% of those the questionnaire 
was aimed at ended up in the final regression analyses. 
The final sample may have been particular healthy due to 



selection bias. Furthermore, long-term sickness absence 
can be caused by other reasons than reduced mental 
health. It could, as an example, be due to a somatic illness. 
We have tried to control for physical health in the analyses, 
but it may not be sufficient. 

Conclusion 

The MHI-5 had a higher predictive value for long-term 
sickness absence than the MDI. In a study where the 
predictive value for long-term sickness absence is of im- 
portance the MHI-5 must be recommended as the best 
measure of mental health. The size of the high risk 
group can for the same instrument be very different de- 
pending on the choice of cut-off for case-ness, however, 
the MDI categorized in general fewer persons as 'cases' 
than the MHI-5. 
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